Composing Low-Overhead Scheduling Strategies for Improving Performance of Scientific Applications
نویسندگان
چکیده
Many different sources of overheads impact the efficiency of a scheduling strategy applied to a parallel loop within a scientific application. In prior work, we handled these overheads using multiple loop scheduling strategies, with each scheduling strategy focusing on mitigating a subset of the overheads. However, mitigating the impact of one source of overhead can lead to an increase in the impact of another source of overhead, and vice versa. In this work, we show that in order to improve efficiency of loop scheduling strategies, one must adapt the loop scheduling strategies so as to handle all overheads simultaneously. To show this, we describe a composition of our existing loop scheduling strategies, and experiment with the composed scheduling strategy on standard benchmarks and application codes. Applying the composed scheduling strategy to three MPI+OpenMP scientific codes run on a cluster of SMPs improves performance an average of 31% over standard OpenMP static scheduling.
منابع مشابه
Minimization of Memory Access Overhead for Multi-dimensional Dsp Applications via Multi-level Partitioning and Scheduling
Massive uniform nested loops are broadly used in multi-dimensional DSP applications. Due to the large amount of data handled by such applications, the optimization of data accesses by fully utilizing the local memory and minimizing communication overhead is important in order to improve the overall system performance. Most of the traditional partition strategies do not consider the eeect of dat...
متن کاملWiSeDB: A Learning-based Workload Management Advisor for Cloud Databases
Workload management for cloud databases deals with the tasks of resource provisioning, query placement, and query scheduling in a manner that meets the application’s performance goals while minimizing the cost of using cloud resources. Existing solutions have approached these three challenges in isolation while aiming to optimize a single performance metric. In this paper, we introduce WiSeDB, ...
متن کامل\threads: a System for the Support of Concurrent Programming". Technical Report
Many parallel applications are implemented using lightweight thread packages. The low overhead associated with user-level thread management encourages programmers to use threads to exploit ne-grain parallelism in an application. Although the overhead of explicit thread management can be very small, there is other overhead associated with lightweight threads: the time required to load data into ...
متن کاملMulti-level partitioning and scheduling under local memory constraint
Massive uniform nested loops are broadly used in scientiic and DSP applications. Due to the large amount of data handled by such applications, the optimization of data accesses by fully utilizing the local memory and minimizing communication overhead is important in order to improve the overall system performance. Most of the traditional partition strategies do not consider the eeect of data ac...
متن کاملAdaptive Shutdown Scheduling Strategies in Chip-Multiprocessor Platforms for Future Mobile Terminals
The latest process technology improvements has enabled the presence of increasingly powerful mobile terminals. It is likely that future generations will have similar functionality and flexibility as todays high-performance workstations, and thus the performance requirements as such. We propose an adaptive chip-multiprocessor architecture to provide performance at a low power consumption cost, w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015